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Abstract. In this paper, we describe a new method to compute the minimum of a real 
polynomial function and the ideal defining the points which minimize this polynomial func- 
' tion, assuming that the minimizer ideal is zero-dimensional. Our method is a generalization 

of Lasserre relaxation method and stops in a finite number of steps. The proposed algo- 
rithm combines Border Basis, Moment Matrices and Semidefinite Programming. In the case 
where the minimum is reached at a finite number of points, it provides a border basis of the 
minimizer ideal. 
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1. Introduction 



Optimization appears in many areas of Scientific Computing, since the solution of a problem 
can often be described as the minimum of an optimization problem. Local methods such as 
gradient descent are often employed to handle global minimization problems. They can be 
very efficient to compute a local minimum, but the output depends on the initial guess and 
they give no guarantee of a global solution. 

In the case where the function / to minimize is a polynomial, it is possible to develop 
methods which ensure the computation of a global solution. Reformulating the problem as 
the computation of a (minimal) critical value of the polynomial /, different polynomial system 
solvers can be used to tackle it (see e.g. |27J, |8j). But in this case, the complex solutions 
of the underlying algebraic system come into play and additional computation efforts should 
be spent to remove these extraneous solutions. Semi-algebraic techniques such as Cylindrical 
Algebraic Decomposition or extensions |29j may also be considered here but suffer from similar 
issues. Though the global minimization problem is known to be NP-hard (see e.g. |23j). a 
practical challenge is to device methods which take into account "only" the real solutions of 
the problem or which can approximate them efficiently. 
Previous works. About a decade ago, a relaxation approach has been proposed in [14| (see 
also [26], |31j ) to solve this difficult problem. Instead of searching points where the polynomial 
/ reaches its minimum /*, a probability measure which minimizes the function / is searched. 
This problem is relaxed into a hierarchy of finite dimensional convex minimization problems, 
that can be solved by Semi-Definite Programming (SDP) techniques, and which converges 
to the minimum /* |14j . This hierarchy of SDP problems can be formulated in terms of 
linear matrix inequalities on moment matrices associated to the set of monomials of degree 
< t G N for increasing values of t. The dual hierarchy can be described as a sequence of 
maximization problems over the cone of polynomials which are Sums of Squares (SoS). A 
feasibility condition is needed to prove that this dual hierarchy of maximization problems also 
converges to the minimum /*, ie. that there is no duality gap. 

From a computational point of view, this approach suffers from two drawbacks: 

(1) the hierarchy of optimization problems may not be exact, ie. it may not always 
convergence in a finite number of steps; 

(2) the size of the SDP problems to be solved grows exponentially with t. 

l 



2 



MARTA ABRIL BUCERO, BERNARD MOURRAIN, PHILIPPE TREBUCHET 



To address the first issue, the following strategy has been considered: add polynomial in- 
equalities or equalities satisfied by the points where the function / is minimum. Inequality 
constraints can for instance be added to restrict the optimization problem to a compact subset 
of R n and to make the hierarchies exact |14] . [19] . Natural constraints which do not require 
apriori bounds on the solutions are for instance the vanishing of the partial derivatives of 
/. A result in |17j shows that if the gradient ideal (generated by the first differentials of /) 
is zero-dimensional, then the hierarchy extended with constraints from the gradient ideal is 
exact. It is also proved that there is no duality gap if "good" generators of the gradient ideal 
are used. In [25], it is proved that the extended hierarchy is exact when the gradient ideal is 
radical and in |24j the extended hierarchy is proved to be exact for any polynomial / when 
the minimum is reached in M n (see also [9j ) . In |30j , the relaxation techniques are analyzed for 
functions for which the minimum is not reached and which satisfies some special properties 
"at infinity". 

From an algorithmic point of view, this is not ending the investigations since a criteria is 
needed to determine at which step the minimum is reached. It is also important to known if 
all the minimizers can be recovered at that step. 

Methods based on moment matrices have been proposed to compute generators of the ideal 
characterizing the real solutions of a polynomial system. In [16], the computation of generators 
of the (real) radical of an ideal is based on moment matrices which involve all monomials of 
a given degree and a stopping criterion related to Curto-Fialkow flat extension condition [5j 
is used. This method is improved in |13j . Combining the border basis algorithm of |21| 
and a weaker flat extension condition [18J, a new algorithm which involves SDP problems of 
significantly smaller size is proposed to compute the (real) radical of an ideal, when this (real) 
radical ideal is zero-dimensional. In |25| . an algorithm is also proposed to compute the global 
minimum of the polynomial function / based on techniques from |17j . It terminates when the 
gradient ideal is radical zero-dimensional. 

An interesting feature of these hierarchies of SDP problems is that, at any step they provide 
a lower bound of /* and the SoS hierarchy gives certificates for these lower bounds (see e.g. 
|12| and reference therein). In [101 111] it is also shown how to obtain "good" upper bounds 
by perturbation techniques, which can directly be generalized to the approach we propose in 
this paper. 

Contributions. We show that if the minimum /* is reached in R n , a generalized hierarchy of 
relaxation problems which involve the gradient ideal I gra d(f) i s exact and yields the generators 
of the minimizer ideal I m i n (f) from a sufficiently high degree. 

In the case where the minimizer ideal is zero dimensional, we give a criterion for deciding 
when the minimum is reached, based on the flat extension condition in |18j . 




This criterion is used in a new algorithm which computes a border basis of the minimizer 
ideal of a polynomial function, when this minimizer ideal is zero-dimensional. 

The algorithm is an extension of the real radical algorithm described in |13j . The rows and 
columns of the matrices involved in the semi-definite programming problem are associated 
with the family of monomials candidates for being a basis of the quotient space, i.e, a subset 
of monomials of size much smaller than the number of monomials of the same degree. Thus, 
the size of the SDP problems involved in this computation is significantly smaller than the 
one in [U], [J?] or [23]. 

We show that by solving this sequence of SDP problems, we obtain in a finite number 
of steps the minimum /* of /, with no duality gap. When this minimum is reached, the 
kernel of the Hankel matrix associated to the solution of the SDP problem yields generators 
of the minimizer ideal which are not in the gradient ideal. Assuming that the minimizer ideal 
Imin(f) is zero dimensional, computing the border basis of this kernel yields a representation 
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of the quotient algebra by I m in(f) an d thus a way to compute effectively the minimizer points, 
using eigenvector solvers. 

An implementation of this method has been developed, which integrates a border basis 
implementation and a numerical SDP solver. It is demonstrated on typical examples. 
Content. The paper is organized as follows. Section 2 recalls the concepts of algebraic 
tools as ideals, varieties, dual space, quotient algebra, the definitions and theorems about 
border basis, and the Hankel Operators involved in the computation of (real) radical ideals 
and in the computation of our minimizer ideal. In Section 3, we describe the main results 
on the exactness of the hierarchy of SDP problems on truncated Hankel operators and show 
that the minimizer ideal can be computed from the kernel of these Hankel. In Section 4, we 
analyze more precisely the case where the minimizer ideal is zero-dimensional. In section 5, 
we describe our algorithm and we prove its correctness. Finally in Section 6, we illustrate the 
algorithm on some classical examples. 

2. Ideals, dual space, quotient algebra and border basis 

In this section, we set our notation and recall the eigenvalue techniques for solving polyno- 
mial equations and the border basis method. 

2.1. Ideals and varieties. Let K[x] be the set of the polynomials in the variables x = 
(xi, . . ., x n ), with coefficients in the field K. Hereafter, we will chooseQ K = R or C. Let 
IK denotes the algebraic closure of K. For a E N n , x a = x" 1 • • • x" n is the monomial with 
exponent a and degree \a\ = ^i a «- The set of all monomials in x is denoted A4 = Ai(x). 
We say that x a < x^ if x a divides xr, i.e., if a < (3 coordinate- wise. For a polynomial 
/ = ^2 a f a x a , its support is supp(f) := {x a \ f a ^ 0}, the set of monomials occurring with 
a nonzero coefficient in /. 

For t E N and S C K[x], we introduce the following sets: 

• St is the set of elements of S of degree < t, 

• (S) = { J2feS -V / I / e ^ e ^} ^ s t ne linear span of S, 

• (S) = { ^2f € sPf f \ Pf £ ^[ x L / G 5"} is the ideal in K[x] generated by S, 

• ( S 1 1) = {J2feS t Pf f I PS e K [ x ]t-deg(/) } is the vector space spanned by {x a / | / E 
St, \a\ < t- deg(/)}, 

• S + := S U x\S U . . . U x n S is the prolongation of S by one degree, 

• dS := S + \ S is the border of S, 

t times 

• := S^ ^ is the result of applying t times the prolongation operator on S, 
with S't 1 ' = S + and, by convention, S^ = S. 

Therefore, S t = S n K[x] t , S® = {x a f \ f E S, \a\ < t}, (S \ t) C (S) n K[x] t = (S) t , but the 
inclusion may be strict. 

If B C M contains 1 then, for any monomial m E A4, there exists an integer k for which 
m E B^ k \ The B-index of m, denoted by 60(711), is defined as the smallest integer k for which 
m E 0W. 

A set of monomials B is said to be connected to 1 if 1 E B and for every monomial m/1 
in £>, m = Xi m' for some ?o £ [L ra ] an d m ' G £>. 

Given a vector space £CK, its prolongation := E + . . . + x n E is again a vector 

space. 

The vector space E is said to be connected to 1 if 1 E £7 and any non-constant polynomial 
p £ E can be written as p = po + X^=i -^Pi f° r some polynomials po,pi E £7 with deg(po) < 
deg(p), deg(pj) < deg(p) — 1 for i E [1, n]. 



For notational simplicity, we will consider only these two fields in this paper, but M and C can be replaced 
respectively by any real closed field and any field containing its algebraic closure. 
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Obviously, E is connected to 1 when E = (C) for some monomial set C C Ji4 which is 
connected to 1. Moreover, E + = (C + ) if E = (C). 

Given an ideal 7 C K[x] and a field L 3 I, we denote by 

Vt(I) := {xeh n \ f(x) = V/ G /} 

its associated variety in L n . By convention V(I) = V^(I). For a set V C IK n , we define its 
vanishing ideal 

I(V) := {/ G K[x] | f(v) = V« G K}. 

Furthermore, we denote by 

\fl : = {/ G X[x] | f m G 7 for some m G N \ {0}} 

the radical of 7. 

For K = R, we have V(7) = Vc(7), but one may also be interested in the subset of real 
solutions, namely the real variety Vjn(I) = V(I) D W 1 . The corresponding vanishing ideal is 
7(Vr(7)) and the real radical ideal is 

Vl :={p£ M[x] | p 2m + Qj G 1 for some Qj G R[x], m G N\ {0}}. 

i 

Obviously, 

K V7c/(F C (/)), /C ^/C 1(^(7)). 

An ideal / is said to be radical (resp., real radical) if I = \fl (resp. I = \fl). Obviously, 
I C 7(V(/)) Q I(Vr(I)). Hence, if I C R is real radical, then / is radical and moreover, 
V{I) = V R (I) C R n if |F E (J)| < oo. 

The following two famous theorems relate vanishing and radical ideals: 

Theorem 2.1. 

(i) Hilbert's Nullstellensatz (see, e.g., [4, §4.1]; VI = I(V C (I)) for an ideal I C C[x]. 

(ii) Real Nullstellensatz (see, e.g., [H §4.1] j Vl = I(Vr(I)) for an ideal I C R[x]. 

2.2. The roots from the quotient algebra structure. Given an ideal / C K[x], the 
quotient set K[x]/7 consists of all cosets [/]:=/ + 7 = {/ + q \ q G 7} for / G K[x], i.e., 
all equivalent classes of polynomials of K[x] modulo the ideal 7. The quotient set K[x]/7 
is an algebra with addition [/] + [g] := [/ + g], scalar multiplication A[/] := [A/] and with 
multiplication [f][g] := [fg], for A G R, f,g G K[x]. 

We will say that an ideal 7 is zero-dimensional if < | !%■(/) | < oo. Then K[x]/7 is a finite- 
dimensional vector space and its dimension is the number of roots counted with multiciplicity 
(see e.g. [3], [6]). Thus |V^(7)| < dimuK[x]/7, with equality if and only if 7 is radical. 

Assume that < |1^(7)| < oo and set := dimKK[x]/7 so that |%(7)| < N < oo. 
Consider a set B := {b\, . . . , b^} C K[x] for which {[bi], . . . , [&at]} is a basis of K[x]/7; by 
abuse of language we also say that B itself is a basis of K[x]/7. Then every / G K[x] can 
be written in a unique way as / = Yli=i + Pi wnere c i G K, p G 7; the polynomial 
7T j s(/) := 2*=i c ibi is called the remainder of / modulo 7, or its normal form, with respect 
to the basis B. In other words, (B) and K[x]/7 are isomorphic vector spaces. 

Given a polynomial h G K[x], we can define the multiplication (by h) operator as 

M h : K[x]/7 — )• K[x]/7 
1 j [/] — ► A<fc([/]) := [A/] , 

Assume that N := dimKlK[x]/7 < oo. Then the multiplication operator Aih can be rep- 
resented by its matrix, again denoted A4h for simplicity, with respect to a given basis 
B = {&i,..., b N } of K[x]/7. 
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Namely, setting 717 gQibj) := Y^,i=i a ijbi for some scalars aij £ K, the jth column of M.^ is 
the vector {ciij)f =l . 

Define the vector £g tV := (bj{v))^ =l G IK , whose coordinates are the evaluations of the 
polynomials bj £ B at the point v £ K™. The following famous result (see e.g., [3j Chapter 
2§4], [6]) relates the eigenvalues of the multiplication operators in K[x]/J to the algebraic 
variety V{I). This result underlies the so-called eigenvalue method for solving polynomial 
equations [7J and plays a central role in many algorithms, also in the present paper. 

Theorem 2.2. Let I be a zero- dimensional ideal in K[x], B a basis of~K[x]/L, and h £ K[x]. 
The eigenvalues of the multiplication operator Ai^ are the evaluations h{v) of the polynomial h 
at the points v £ V(I). Moreover, {Mh) T Cb,v = h{ v )Cl3,v an d the set of common eigenvectors 
of (Mh)h&K[x] are U P t° a non-zero scalar multiple the vectors ^s,v f or v £ V(I). 

Throughout the paper we also denote by Aii := A4 Xi the matrix of the multiplication 
operator by the variable Xj. By the above theorem, the eigenvalues of the matrices _Mj are 
the ith coordinates of the points v £ V(I). Thus the task of solving a system of polynomial 
equations is reduced to a task of numerical linear algebra once a basis of K[x]// and a normal 
form algorithm are available, permitting the construction of the multiplication matrices Aii- 

2.3. Border bases. The eigenvalue method for solving polynomial equations from the above 
section requires the knowledge of a basis of K[x]// and an algorithm to compute the normal 
form of a polynomial with respect to this basis. In this section we will recall a general method 
for computing such a basis and a method to reduce polynomials to their normal form. 
Throughout B C Ai is a finite set of monomials. 

Definition 2.3. A rewriting family F for a (monomial) set B is a set of polynomials F = 
{fi}ieX suc h that 

• supp(fi) C B + , 

• fi has exactly one monomial in dB, denoted as j(fi) and called the leading monomial 
of fi. (The polynomial fi is normalized so that the coefficient ofj(fi) is 1.) 

• ifl(fi) = l(fj) then i=j. 

Definition 2.4. We say that the rewriting family F is graded i/deg(7(/)) = deg(/) for all 
f€F. 1 ' 

Definition 2.5. A rewriting family F for B is said to be complete in degree t if it is graded 
and satisfies (dB)t Q l(F)> that is, each monomial m £ dB of degree at most t is the leading 
monomial of some (necessarily unique) f £ F. 

Definition 2.6. Let F be a rewriting family for B, complete in degree t. Let hf,b be the 
projection on (B) along F defined recursively on the monomials m £ Mt in the following way: 

• if m £ Bt, then ^^(m) = m, 

• if m £ (dB)t (= (jf?W \ B^)t), then ^^(m) = m — f , where f is the (unique) 
polynomial in F for which 'y(f) = m, 

• ifme (BW \ B [fe ~ 1] )t for some integer k > 2, write m = Xi m' , where m' £ B^ k 1 ' 
and iq £ [l,n] is the smallest possible variable index for which such a decomposition 
exists, then 7TF,B( m ) = ^F,B( x io ^F,B{ m '))- 

If F is a graded rewriting family, one can easily verify that &eg(iTF,B{ m )) < deg(m) for 
m £ A4f The map ttf,b extends by linearity to a linear map from K[x]( onto (B)t- By 
construction, / = 7(/) — ^Fsilif)) an d ttf,s(/) = for all / £ Ft. The next theorems show 
that, under some natural commutativity condition, the map ttfB coincides with the linear 
projection from K[x]t onto (B)t along the vector space {F \ t). It leads to the notion of border 
basis. 
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Definition 2.7. Let B C M be connected to 1. A family F C K[x] is a border basis for B if 
it is a rewriting family for B, complete in all degrees, and such that K[x] = (B) © (F). 

An algorithmic way to check that we have a border basis is based on the following result, 
that we recall from |21| : 

Theorem 2.8. Assume that B is connected to 1 and let F be a rewriting family for B , complete 
in degree t E N. Suppose that, for all to G Mt~2, 

(2) K F ,B{xi ^F,B(xj m)) = TT F)l 3(xj irF,B(xi to)) for all i,j G [1, n]. 

Then itpfi coincides with the linear projection o/K[x]i on (B)t along the vector space (F\t); 
that is, K[x] t = (B) t ®(F\t). 

In order to have a simple test and effective way to test the commutation relations ([2]), we 
introduce now the commutation polynomials. 

Definition 2.9. Let F be a rewriting family and f,f G F. Let m,m' be the smallest degree 
monomials for which m"f(f) = m ,r y(f'). Then the polynomial C(f,f) := mf — m'f = 
m'TTF^sif') — m^F,B(f) * s called the commutation polynomial of f,f. 

Definition 2.10. For a rewriting family F with respecet to B, we denote by C + {F) the set 
of polynomials of the form mf — ml f, where f,f r £F and 
m, to' G {0, 1, x\, . . . , x n } satisfy 

• either mj(f) = m! 'y(f'), 

• or mj(f) G B and m! = 0. 

Therefore, C + (F) C (B + ) and C + (F) contains all commutation polynomials C(f,f) for 
f,f r £F whose monomial multipliers m, to' are of degree < 1. The next result can be deduced 
using Theorem 12.81 

Theorem 2.11. Let B C M be connected to 1 and let F be a rewriting family for B , complete 
in degree t. If for all c G C + {F) of degree < t, ttf,b( c ) = 0, then ttfb is the projection o/Kt 
on (B) t along {F\t), te. K t = (B) t ®{F\t). 

If such a property is satisfied we say that F is a border basis for B in degree < t. 

Theorem 2.12 (border basis, |21j). Let B C A4 be connected to 1 and let F be a rewriting 
family for B, complete in any degree. Assume that ttfb{ c ) = f or a ^ c 6 C + {F).Then B is 
a basis o/K[x]/(F), K = (B) (F), and (F) t = (F\'t) for all t G N; the set F is a border 
basis of the ideal I = (F) with respect to B. 

This implies the following characterization of border bases using the commutation property. 

Corollary 2.13 (border basis, |20|). Let B C M be connected to 1 and let F be a rewriting 
family for B, complete in any degree. If for all m G B and all indices i,j G [l,n], we have: 

7TF,B(xi ^F,B( X j m )) = ^FtfiXj ^F,B{Xi TO)), 

then B is a basis ofK/(F), K = (B) (F), and (F) t = (F\t) for all t G N. 

2.4. Hankel operators and positive linear forms. This section is based on [IB"] . 

Definition 2.14. For A G R[x]*, the Hankel operator H\ is the operator from M[x] to R[x]* 
defined by 

(3) H\ : p G R[x] 4p4e R[x]* 
Definition 2.15. We define the kernel of the Hankel operator: 

(4) ker H A = {p G R[x] | p ■ A = 0, i.e, A(pq) = G R[x]} 
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To analyse the properties of A. we study the quotient algebra K[x]/ker H\ = A\. A first 
result is the following (see e.g. |13j): 



Lemma 2.16. The rank of the operator H\ is finite if and only if ker H\ is a zero- dimensional 
ideal, in which case dimK[x]/ ker H\ = rankH\. 

For a zero-dimensional ideal I C IK[x] with simple zeros V(I) = {£1, . . . , £ r } C IK n , we have 
I 1 - = (1^, . . . , l( r ) and the ideal / is radical as a consequence of Hilbert's Nullstellensatz. 
When / = keri^Aj this yields the following property. 

Proposition 2.17. Let IK = C and assume that rankH\ = r < oo. Then, the ideal keri^A 
is radical if and only if 
r 

(5) A = ^ Ail Cj with Xi G K - {0} and & G K n pairwise distinct, 

i=i 

in which case ker H\ = I(Ci, ■ ■ • , Cr) is the vanishing ideal of the points Cl> • • • j Cr- 

As a corollary, we deduce that when IK = M and rankH\ = r < oo, the ideal ker H\ is real 
radical if and only if the points Q are hi M. n . 

Definition 2.18. We say that A G R[x]* is positive, which we denote A ^ 7 if A(p 2 ) > 
for all p G R[x]. We have that A ^ iff ^ 0. // moreover A(l) = 1, we say that A is a 
probability measure. 

This term is justified by a theorem of Riesz-Haviland [28], which states that a Lebesgue 
measure on M n is uniquely determined by its value on the polynomials G K[x]. In particular, if 
A ^= and A(l) = 1, there exists a unique probability measure \x such that \/p G M[x], A(p) = 
Jpdfi. 

An important property of positive forms is the following: 

Proposition 2.19. Assume rankH^ = r < oo. Then A )>= if and only if A has a decompo- 
sition ([5|) with Xi > and distinct Q £ ^- n , in which case V(ker Hjs) = {d, . . . , £ r } C W 1 . 

In particular, it shows that if A ^= 0, then keri^A is a rea l radical ideal. 

3. Main Results 

In this section, we give the main results which shows how the minimum of / and the ideal 
defining the points where this minimum is reached, can be computed. 

Hereafter, we will assume that the minimum /* of / is reached at a point x* G 1™. 

Definition 3.1. We define the gradient ideal of f(x): 

'V df . 



(6) W/> = (V/(*)) = ( w -, 8ii/ . 

Definition 3.2. We define the minimizer ideal of f(x): 

(7) Imin(f) = G R n s.t. /(x*) is minimum). 

By construction, I gra d(f) C I m in{f) an d I m in{f) 7^ (1) if the minimum /* is reached in 
W 1 . The objective of this section is to describe a method to compute generators of I m i n (f) 
from generators of I gra d(f)- F° r that purpose, first of all we need to restrict our analysis to 
matrices of finite size. For this reason, we consider here truncated Hankel operators, which 
will play a central role in the construction of the minimizer ideal of /. 

Definition 3.3. For a vector space E C K[x] 7 let E ■ E := {pq \ p, q G E}. For a linear form 
A G (E • E)* , we define the map Hf : E E* by Hf (p) = p ■ A for p G E. Thus iff is 
called a truncated Hankel operator, defined on the subspace E. 
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Definition 3.4. We define the kernel of the truncated Hankel operator: 

(8) kerFf = {p G E \ p- A = 0,i.e, A(pg) = V<? G E}. 
When E = R[x] t for f € N, Ff is also denoted F A . 

Given a subspace Eq G E, A induces a linear map on {Eq ■ Eq) and we can consider the 
induced truncated Hankel operator Hf° : Eq — > (Eq)* as a restriction of Hf. 

Below we will deal with the following sets and minimums, in order to define our primal-dual 
problems. 

Definition 3.5. Given a vector space E C R[x] and G C (E ■ E), we define 

(9) : = {A £ {£ ■ £)* | A _L G, A(l) = 1, A(p 2 ) > Vp G £}. 
If E = M.[x\t and G' = (G\ 2t), we also denote £g',e,^ by Cc,t,^- 

Notice that if G C G' and E C E' then Cg' E' ^ C Cge^- When and G are vector 
spaces of finite dimension, Cq e^ is the intersection of the closed convex cone of semi-definite 
positive quadratic forms on E x E with a linear space, thus it is a convex closed semi-algebraic 
set. More details on its description will be given in Section 2J We will need the following 
result: 

Lemma 3.6. For any vector space E and G = {0} and A, A' G £g,e,>^, we have: 

• \/p G E, A(p 2 ) = implies p G ker H^. 

• ker Ff +A , = ker Ff n ker iff, . 

Proof. The first point is a consequence of the positivity of Ff . 

For the second point, Vp, g G Vt G R, A((p + tg) 2 ) = t 2 A{q 2 ) + 2tA{pq) > 0. Dividing 
by i and letting i go to zero yields A(pg) = 0, thus showing p G kerFf . The inclusion 
ker Ff n ker Ff, C ker Ff +A , is obvious. 

Conversely, let p G kerF A +A'- m particular, (A + A')(p 2 ) = 0, which implies A(p 2 ) = 
A'(p 2 ) = (since A(p 2 ), A'(p 2 ) > 0) and thus p G ker F A n ker F A /. □ 

Definition 3.7. Given a vector space E C R[x] and G C (E ■ E), we define 

s 

(10) 5 Gi£ := { p G R[x] \p=J2hi+h, hi G E, he G}. 

i=l 

If E = R[x]t and G' = (G | 2i) 7 we a/so denote Sq',e by Sc,t- 

Notice that if G C G' and E <Z E' then 5g. _e C Sq 1 t E' ■ When E 1 and G are vector spaces 
of finite dimension, Sq,e is the projection of the sum of a linear space and the convex cone 
of positive quadratic forms on E* x E*. 

Definition 3.8. Let E be a subspace of R[x] such that 1 G E and f G (E ■ E) and let 
G C (E ■ E). We assume that f attains its minimum at some points x* G R n . We define the 
following extrema: 

• /* = min^gign f(x), 

. /£ E = inf {A(/0 S .i. A6£ G , % }, 

• /ei = sup {A G M s.i. / — A G 5 GiS }. 

If E = R[x]t (resp. £7 = R[x]J and G' = (G | 2t) we a/so denote Jq, e by f^ t (resp. f£) and 
f&*E by f^f (resp. fg s ). 

Remark 3.9. By convention if the sets are empty, f G ° E = — oo and f GE = +oo. 
2 the notation M stands for measure 
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We now analyse the relations between these different extrema. 

Remark 3.10. Let E C E' be two subspaces o/R[x] and G C (E • E), G' C (E' ■ E') with 
G C G' then we directly deduce that 

• J G,E — JG',E'> 

_ fSOS <- fsos 

• JG,E — JG',E" 

from the fact that Cg',E',^ C £-g,e,^ and Sq,e C Sg',e'- 

If we take G C I m in(f), we have the following relations between the extrema. 
Proposition 3.11. Let G C W/). T/ien < /g >£ < /*. 

Proof. We have < /q e because if there exists A £ R such that / — AG Sq,e, i- e - / — A = 
Zitf + g with J* 6 -B and jGG then VA G C G , E> , A(/ - A) = A(/) - A = A(/i?) > 0. 
We deduce that A(/) > A and we conclude f GE < f G E - For the second inequality, let x* be 
a point of R n such that /(x*) is the mininum of / and let l x * G R[x]* : p i — > p(x*) be the 
evaluation at x*. Then we have i?i x * fc= , l x *(l) = 1 and 1 X *(G) = since G d Lmini.fj} so 
that l x * G Cg,E,)f- We deduce the inequality f GE < /*• □ 

Following the relaxation approach proposed in [14] : we are now going to consider a hierarchy 
of convex optimization problems and show that for such hierarchy, the minimum /* is always 
reached in a finite number of steps. Let us consider the sequence of spaces 

■ • • C £g,*+1> C ^G,i> C • • • and • • • C Sc,t C <Sg,*+1 C • • • 
for i G N, G C L m i n (f). Using Remark 13. 101 and the fact that (G) = {Jt£^{G\t) , we check that 

• the increasing sequence ■ ■ ■ f G t < f G t+1 <•••</*, for t G N converges to < /*, 

• the increasing sequence • • • f G f < f G f + i <•••</*, for t G N converges to f? G i < f*. 
We are going to show that these limits are attained for some t G N. 

The next result which is a slight variation of a result in [15j (and also used in [13] ) shows 
that for a high enough degree, the kernel of some truncated Hankel operators allows us to 
compute generators of the real radical of an ideal. 

Proposition 3.12. For G C R[x] with I gra d{f) C (G), there exists to G N such that \/t > to, 
VAe£ Git>! yigradif) C (kertf**)- 

Proof. Let gi,...,g n be generators of / := L grad (f), d s := deg(g s ), d : = max s= i v .. jn 4 and 
h\, . . . , hk be generators of the ideal J := v 7 ?- By the Real Nullstellensatz, for I G 1, . . . , k, 
there exist mi G N, m; > 1 and polynomials and sums of squares 07 such that hf mi +o~i = 

Yjj=i u j 9j- As / C (G), there exist t such that u?gj G (Gr|£ )- Set to := max Kfcj<n(^o> 
d,deg(hf mi ),deg(ai)) and let i > to- As uj'gj G (C 1 1) , uygj G kerff^ for all A G C G ,t,^- 
Hence hf m ' + 07 E ker , which implies that hi G ker Hj^ since ^=0. □ 

To compute generators of L m i n (f), we use the decomposition of V^(L gra( i(f)) in components 
where / has a constant value. 

Lemma 3.13. Let V^(I grac i(f)) = Wq U W\ U . . . U W s be the decomposition of the variety in 
disjoint real subvarieties, such that f(Wj) = fj G R with fi < fj V0 < i < j < s. Then there 
exist polynomials po, . . . ,p r G R[x] such that Pi(Wj) = 5ij, where 5ij is the Kronecker delta 
function. 

Proof. As in [25j, we decompose Vc{L gra d{f)) as an union of complex varieties 

Vc{L g rad{f)) = Wc$ U W c ,i U . . . U W CyS U W c , s +i such that Wc,i, i = 0, . . . , s have real 

points and f(Wc,i) = ft G R is constant on Wa and Wc,s+i nas no rea l point. We number 
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these varieties so that fo < fi < ■ ■ ■ < f s and /o = /*■ By construction, the varieties 
Wi := Wc,i n 1" are disjoint, / is constant on W* and V R (I grad (f)) = W U Wi U . . . U W s . 
Let us take pi = Lj(/(x)) where Lq, . . . , L s are the Lagrange interpolation polynomials at the 
value f Q ,...,f n eR. They satisfy Pi(Wj) = . □ 

Remark 3.14. By definition the polynomials pi have the following properties: 



• po + . . . +p s = 1 mo dulo yl gradU)- 

• (Pi) 2 = Pi modulo y/Igradif) V? = 0, . . . , S. 

The next results shows that in the sequence of optimization problems that we consider, the 
minimum of / is reached from a certain degree. 

Theorem 3.15. For G C R[x] with I gra d{f) C (G) C I m in(f), there exists t\ > such that 
Vi > ti, /£°f = /£ )t = /* and VA* G £ G>i> mtA A*(/) = f£ p we have Pi G kei H{ t Vt = 
1, . . . ,s. 

Proof. By [M] [Theorem 2.3], there exists i' x G N such that \/t > t[, /ffj = /£ ( = /* where 
D = im^'->di^}- As ( D ) = tgradif) C (G) = (gi,...,g s ), there exists G R[x] such 
that = Y.j=iH,j9j- Let d = max{deg(g;j), i = l,...,n,j = l,...,s}. Then (D 1 1) C 
(G|t + d). By Remark EM and Proposition EU for t > t[, f* = ffj os t < f™° +d < f£ t+d < 
/*. Thus for t > t'i + d, we have 

(11) m = r G , = /*• 

Let J = y/Ig r ad(f)- By Lemma 13.131 and Remark I3.14[ we can write 

s 

(12) / = fiPi modulo J, 

i=0 

where ft = /(Wi) G R and / = /(W ) = /* . Then 

s s s s 

(13) / = r P 2 o +Y,fiPi = /*(i - T,p 2 r) + E = /* + Etf* - mo(Mo J - 

i=l r=l j=l i=l 

Hence 

(14) f-r=Y,(fi-nrf+h. 

i=i 

with h £ J. By Theorem 13.121 there exists > to such that A(/i) = for all A G ker£ G 4 » v_. 

Let us take fj = maijt'j + d, i'/, deg(pi), . . . ,deg(p s )}, i > t\ and A* G £g,*> such that 
A*(/) = From Equation (jlip . we deduce that 

s 

(is) a*(/ - d = o = J2(fi - rw(p 2 ) 

i=l 

This implies that A*(pf) = and p { G ker for z = 1, . . . , s, since /i — /* > and A* !>= 
on M[x]f. " □ 

The example 3.4 in [25j shows that it may not always be possible to write f — f* as a sum of 
squares modulo I gra d(f) but, from the previous result we see that f — f* is the limit of sums 
of squares modulo Igrad(f) fll[x]j for a fixed t>t\. Under the assumption that there exists 
x* G W 1 such that /(x*) = /*, we can construct A* = l x » G Cgj,)? such that A*(/) = /*. 
This means that /^ ( is reached for t>t\. 
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A direct corollary of the previous theorem is that for any G C R[x] with I gra d(f) C (G) C 
Imin(f), we have 

/SOS fSOS ff-l £^ J** 

Igrad(f) ~ J(G)~ h grad {f) ~ J (G)~ J ■ 

As for the construction of generators of the real radical ^/I gra d{f) (Proposition 13.12]) . we 
can construct generators of I m in(f) from the kernel of a truncated Hankel operator associated 
to any linear form which minimizes /: 

Theorem 3.16. For G C M[x] with I gra d(f) C (G) C Imin(f), there exists t 2 £ N such that 
Vt>t 2 , for A* G £ Gjt> with A*(/) = f£ >tl we have I min {f) C (kerif^). 

Proof. Let / = I gra d(f) an d J = \fl- We consider again the above decomposition Vu.(I) = 
W U Wi U . . . U W s with f = /(Wo) < /i = /(Wi) <■..</„ = /(W s ). We denote by 
hi, . . . , hk a family of generators of the ideal I m in(f) = I (Wo) and d = max{deg(/ii), . . . , deg(%)}. 

Let us fix < j < A; and show that /ij G ker H^, for i sufficiently large. We know that 
Poh'j \w = and that for i = 1, . . . , s, Poh 2 - \wi= which implies that po^-j G J := y/I gr ad(f)- 
By Theorem 13. 12[ there exists ^ j such that A(po^j) = for all A G Cc.t,^ with t > t' 2 j- 

By Theorem I3.15[ pi G ker .ff A * for t > ti . By Remark I3.14[ po + . . . + p s = 1 modulo J. 
By Theorem I3.12| there exists t' 2 ' such that A(po + . . . + p s — 1) = for all A G &G,t,)f= with 



* > *2- 

Let us take t 2 : = max{ii + 2 d, t 2 + 2 d, t 2 1 , . . . , t' 2 k } and t > t 2 . Then for A* G £g,*> with 
A *(/) = fa,v we have 

A*(h 2 ) = A*(h 2 jPo ) + A*(^pi) + . . . + A*(h)p s ) = 0. 
Hence, /ij G ker for j = 1, . . . , k and I m in(f) C (kerlf^*). □ 

We introduce now the notion of generic linear form for f G K[x]. Such a linear form will 
allow us to compute I m in(f) as we will see. 

Proposition 3.17. For A* G J~-g,e,^ and p G M[x], £/ie following assertions are equivalent: 

(i) rankHf* = max KeCa E ^ Mp )=p£ /ankilf . 

(ii) VA G £g,£> swc/i t/wrf A(p) = p G£ , ker iTf» C kerilf. 

(iii) rankH^° = rnax^ &J r G B >_,A(p)=pg E rankH^° for any subspace Eq C £7. 

We say that A* G £c,E,)p is generic forp if it satisfies one of the equivalent conditions (i)-(iii). 
Proof, (i) => (ii): Note that |(A+A*) G £g,s> and ±(A+A*)(p) = p£ B and ker #f (A+A » } = 
ker iZj^ n ker i? A * (using Lemma l3.6|> . Hence, rankHf,^^ > rankH^* and thus equality 



_> 



holds. This implies that ker H i^ A _|_ A *) = keri^ A * is thus contained in keri7 A . 

(ii) ==>- (iii): Given Eg C 22, we show that kerff A P C ker 22 A °. By Lemma 13.61 we have 
ker22 A » C ker22 A » and, by the above, we have ker22 A » C ker22 A . 

(iii) ==>- (i) This implication is obvious. □ 

The next result, which refines Theorem I3.16[ shows only elements in I m in(f) are involved 
in the kernel of a truncated Hankel operator associated to a generic linear form for /. 

Theorem 3.18. Let E,G be as in Definition \3.8\ with G C Imin(f)- If A* G £g,e,^ is generic 
for f and such that A*(f) = /*, then ker 22 A „ C I m in(f)- 

Proof. Let x* G W 1 such that /(x*) = /* is minimum. Let l x * denotes the evaluation 
at x* restricted to (E ■ E) and h G ker22 A ,,. Our objective is to show that h(x*) = 0. 
Suppose for contradiction that h(x*) 7^ 0. We know that l_ x . G Cg,e,^ since G C I m in(f) 
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= J fff(^)(/i) = ±(/i(x>(x*)). 



and l x *(/) = /(x*) = /*. We define A = ±(A* + l x *)- By definition A G C g ,e,> and 
A(/) = i(A*(/) +!».(/)) = K A *(/) + /( x *)) = /*• As & e ker^A- 

A(/> 2 ) = i(A*(/i 2 ) + l x *(/i 2 )) = lh 2 (x*) / 

thus h G ker fl^, \ ker iJj? and by the maximality of the rank of H^, , ker H~? <jt ker H^, . 
Hence there exits h G kerilj 5 \ ker iff,. Then = Hf{h) = \(Hf t (h) + JETf = 
\{Hf,{h) + /i(x*) • l x *). As Ff.(^) / implies /i(x*) / 0. On the other hand, 

= Hf{h){h) = A(hh) = -(A*(hh) + h(x*)h(x*)) = ^(H^(h)(h) + h(x*)h(x*)). 

As h G flf , , then 

1 

As h(x*) 0, since we have supposed that /t(x*) ^ it yields a contradiction. □ 

The last result of this section shows that a generic linear form for / yields the generators 
of the minimizer ideal I m in(f) in high enough degree. 

Theorem 3.19. For G C M[x] with I gra d{f) C (G) C I m in(f), there exists ti G N (defined in 
Theorem ] 3. 16\) such that such that Vi > t2> / or " A* G Cat^ generic for f , we have A*(/) = /* 
and (ker# A „) = I min (f). 

Proof. We obtain the result as consequence of Theorem I3.15( Theorem 13.161 and Theorem 
l3~T8l □ 

As a consequence, in a finite number of steps, the sequence of optimization problems that 
we consider gives the minimum of / and the generators of I m in(f)- 

4. Zero-dimensional Case 

In this section, we describe a criterion to detect when the kernel of a truncated Hankel 
operator associated to a generic linear form for / yields the generators of the minimizer ideal. 
It is based on a flat extension property |18| and applies to global polynomial optimization 
problems where the minimizer ideal I m in{f) is zero-dimensional. 

Definition 4.1. Given vector subspaces Eq C E C K[x] and A G (E ■ E}* , H¥ is said to be 
a flat extension of its restriction if rankH^ = rankH^ . 

We recall here a result from [18J, which gives a rank condition for the existence of a flat 
extension of a truncated Hankel operator. 

Theorem 4.2. Consider a monomial set B C M connected to 1 and a linear function A 
defined on (B + ■ B+). Let E = (B) and E + = (B + ). Assume that rank#f + = rank#f = 
\B\. Then there exists a (unique) linear form A G K[x]* which extends A, i.e., A(p) = A(p) for 
all p G (B + ■ B + ) , satisfying rank-f/^ = rank if A . Moreover, we have ker Hj^ = (ker ). 

In other words, the condition rankif A + = rankii A = \B\ implies that the truncated 
Hankel operator has a (unique) flat extension to a (full) Hankel operator iJr. 

Proposition 4.3. Let E, G be as in Definition \3.8\ with G C I m in(f)- If A* G Cg,e,^ 
coincides with a probability measure \i on (E ■ E) and satisfies A*(f) = /q E , then A*(f) = 

f A 4 f * 

JG,E ~ J ■ 

Proof. By Proposition 13.11] }q E < /*• Conversely as f(x) > /* for all x G M. n , we have 

A*(f) = jfdfi>jf*dfi = r. ' □ 
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Proposition 4.4. // there exists A G Cg,e^ with ker = {0}, then f^ s E = /q E - 

Proof. If ker Hf = {0} implies that A >- 0, i.e, > 0. Hence by Slater's Theorem of [2] we 
have the strong duality then /q E = /q e - □ 

Theorem 4.5. Lei B is a monomial set connected to 1, E = {B + ) and G C {B + ■ B + ) H 
Imin(f)- Let A* G £-G,E,)p such that A* is generic for f and satisfies the flat extension 
property: rank//^* + = rank./T^* = \B\. Then there is no duality gap, f* = fcE = f(3E an< ^ 
(kevH^) = I min (f)- 

Proof. As rank.£/^l + = rankfT^* = \B\, Theorem 14.21 implies that there exists a (unique) 
linear function A* G K[x]* which extends A*. As ranki?^ = rankf/^, = \B\ and kevH^ = 
(ker^f + ), any polynomial p G E[x] can be reduced modulo ker to a polynomial b G (B) 
so that p - b G ker # A ». Then A*(p 2 ) = A*(b 2 ) = A*(b 2 ) > since A* G This implies 

that A* )p= 0. By Proposition 12.191 A* has a decomposition A* = Y2l=i W1 ^h Aj > and 

As A*(l) = A*(l) = 1, A* is a probability measure. By Proposition @~3J A*(/) = A*(/) = 
f A* f * 

As A* = ^i =1 A»l Cj with Aj > and & G M. n and as A*(l) = Ya=i x i = 1 and 
A*(/) = Ei=i Ai/(Ci) = A we deduce that /(&) = /* for » = 1, . . . ,r so that {&,..., Cr} C 
V(I min (f)). 

By Proposition 12.171 Theorem 14.21 and Theorem 13.181 we have 

ker H~ k « = I(( h ...,(r) = (ker ) C W/). 

We deduce that {Ci, • • • , Cr} = V(I min (f)) so that (ker H^) = I min (f). 

As we have the flat extension condition, kerH^, = {0} and by Proposition 14.41 there is not 
duality gap: /* = f G E = f^%. □ 

Notice that if the hypotheses of this theorem are satisfied, then necessarily I m in(f) is 
zero-dimensional. 

5. MlNIMIZER BORDER BASIS ALGORITHM 

In this section we describe the algorithm to compute the global minimum of a polynomial, 
assuming /* is reached in K n and I m in(f) is zero dimensional. It can be seen as a type of 
border basis algorithm, for we insert an additional step in the main loop. It is closely connected 
to the real radical border basis algorithm presented in |13| but instead of "minimizing zero" 
to generate new elements in the real radical, we minimize / to compute generators of the 
minimizer ideal I m in(f)- 

5.1. Description. The convex optimization problems that we consider are the following: 
Algorithm 5.1: Optimal Linear Form 
Input: / G M[x], M = (x Q ) agv 4 a monomial set containing 1 with 
/ = EaeA+A G (M • M), G C R[x]. 
Output: the minimum f^ M of YlaaA+A ^afa subject to: 

- H^i = (h a ,p)a,peA fc= 0, 

- H^i satisfies the Hankel constraints /io,o = 1; an d h a p = h a > gi = \ a +p 

tfa + P = a' + P', 

- A* (g) = EaeA+A 9aK = for all g = EaeA+A 9^ a G G D (M ■ M). 
and A* G (M ■ M)* represented by the vector [A a ] Q gA+A- 
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This optimization problem is a Semi-Definite Programming problem, corresponding to the 
optimization of a linear functional on a linear subspace of the cone of Positive Semi-Definite 
matrices. It is a convex optimization problem, which can be solved efficiently by SDP solvers. 
If an Interior Point Methods is used, the solution A* is in the interior of a face on which the 
minimum A*(/) is reached so that A* is generic for /. This is the case for tools such as csdp 
or sdpa, that we will use in the experimentations. 

In the case, where M = B is a monomial set connected to 1, F is a complete rewriting 
family for B in degree < 2t, and G = {b — 7Tfb(6); b G B ■ B}, we will also denote OPTIMAL 
Linear Form (/, £? t ,7rF iB ):=OPTiMAL Linear Form (f,B t ,G). 

Algorithm 5.2: MlNlMIZER IDEAL OF / 
Input: A real polynomial function / with I m in{f) 7^ (1) and zero-dimensional. 
^ := {Jaf~' •■' Ja^}' * := l"^ e s(/)/2l; B := set of monomials of degree < t; 
f : = -co; stop:=false; 
While not stop 

(1) Compute the commuting relations for F with respect to B in degree <2t. 

(2) Reduce them by the existing relations F. 

(3) Add the non-zero reduced relations to F and update B. 

(4) Let [f FBt ,A*] := Optimal Linear FoRM(f, B t ,iTF,B t ) 

(a) If there is a duality gap then go to 1 with t := t + 1. 

(b) If / / f* Bt then / := f* FBt ; go to 1 with t:=t + l. 

(c) Compute the border basis F' of F + ker H^t' 1 and the basis B' in degree 
<2(t-l). 

(d) If there exists b' G B' with deg(fe') >t-l then go to (1) with t := t + 1. 

(e) Let [fp, B ,,A r ] := OPTIMAL LINEAR FORM(/, B',ttf>,b')- 

(f) If f F , B , = f and ker^" 1 = {0} => stop^true, F = F',B = B', f = f F Bt i 
else go to 1 with t := t + 1. 

Output: A basis B of A = R[x]/I m £ n (/), a border basis F of I m in(f) for B and the 
minimum /. 



5.2. Correctness of the algorithm. In this subsection, we analyse the correctness of the 
algorithm. 

Lemma 5.1. Let t G N, B C K[x]2t be a monomial set connected to 1, F C R[x] be a 
border basis for B in degree < 2t and G = (Bt ■ Bt) H {F \ 2t). Let E C M[x]j be a vector 
space containing B + n M[x]t and G' = (E ■ E) n (F\ 2t). For all A G £G,B t ,>^? there exists 
a unique A G Cg',e,^ which extends A. Moreover, A satisfies rankii^ = rankff^' and 
ker Hf = ker i?f « + (F 1 1) n 

Proof. Suppose that i 7 C R[x] be a border basis for i? in degree < 2t, that is, all boundary 
polynomials of C + (i^2t) reduces to by F<it- Then by Theorem 12.111 we have R[x]2t = 
(B) 2 t ®{F\2t) and (E ■ E) = (B) 2t G', {B t ■ B t ) = (B 2t n B t ■ B t ) © G. Thus for all 
A G Cg B t )pi there exists a unique A £ {E ■ E)* such that A(G') = and A(6) = A(6) for all 
beB t 'B^ 

As E ■ (ED(F\ t)) C {E ■ E) (~){F\ 2t) = G' , we have A(E ■ ((F \t)DE))=0 so that 
(16) {F\t)r\E ckevHf. 

For any element b G ker we have W G B t ,A(bb') = A(bb') = 0. As A(B r ((F \ t)C\E)) = 
and E = (B t ) © ((F 1 1) D -E), for any element e £ E, A(be) = 0. This proves that 
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(17) kerflf* C keriff. 

Conversely as E = {Bt) ffi ((F \ t) D E), any element of E can be reduced modulo (F \ t) fl E 
to an element of {Bt), which shows that 

(18) ker iff C ker Hf 4 + (F \ t) n F. 

From the inclusions (flfi]) , (fT7|) and (HID, we deduce that ker H? = ker ilf* + ((F 1 1) n E) 
and that rank if-' = rank-ff^* . 

As A ^= 0, by projection along {{F \t) D E) C ker H?, \/p G F there exists 6 G such 
that A(p 2 ) = A(6 2 ) > 0. Thus A > which ends the proof of this lemma. □ 

Lemma 5.2. If the algorithm terminates, then F' is a border basis of I m in(f) for B' . 

Proof. If the algorithme stops, this can happen only in step (4) in some degree t + 1 such that 

• L/f,b*+i' A *] := Optimal Linear FoRM(f, B t+1 ,F), 

• F' is a border basis of F + ker H?* for B' in degree < 2 1, 

• the monomials in B' are of degree < t so that B' t = B', 

. [/|v B „A'] := Optimal Linear Form(/,F',F') with f = A'(f) = f^ B , = A*(f) = 
f* FBt and ker H% = {0}. 
Then (5'+) = (£') © {F'). By Lemma O with E = {B'+), the linear form A' G C F > tB ',^ 
can be extended to a linear form A" G £>F",B'+,)p witn ^" = I 2t ) n (-B' 4 " - ^' + ) sucn tnat 
ker#f,', + = (F'|t) n (5+) = (F'). As kerH$ = {0} and (5'+) = {B') © (F'), we have 
rank.£/^,', + = rank Fry = \B'\ and the flat extension theorem (Theorem I4.2[) applies: A" is 
the restriction to {B' ■ B') of a positive linear form 

r 

A' = J>L- 

i=i 

with r = |F/|, Ci G M n distinct, A, > and £Ll A * = L Then A '(/ ) = ELl A i/(Ci) > /*• 
By hypothesis, A'(/) = f*p 2uBt < f* since F C I gra d(f) C Imm(/)- We deduce that A'(/) = 
A*(/) = /*, that (F 2t + kerFf*) = (F') C J min (/) by Theorem EE] and that (F') = I min (/) 
by Theorem 14.51 Therefore F' is ci border basis of Imin 

(/) for 5'. □ 

Proposition 5.3. Assume that I m i n {f) is zero- dimensional. Then the algorithm terminates. 
It outputs a border basis F for B connected to 1, such thatM[x] = {B)(B(F) and (F) = I m i n (f). 

Proof. First, we are going to prove by contradiction that when I m i n (f) is zero-dimensional, 
the algorithm terminates. Suppose that the loop goes for ever. Notice that at each step either 
F is extended by adding new linearly independent polynomials or it moves to t + 1. Since 
the number of linearly independent polynomials added to F in degree < 2 1 is finite, there is 
a step in the loop from which F is not modified any more in degree < 2 t. In this case, all 
boundary C-polynomials of elements of F of degree <2t are reduced to by F 2 t and F 2 £ is a 
border basis for B<it in degree < 2t. By Lemma [5.11 with E = Rt, A* extends to a linear form 
A* G R[x]^ t such that 

K l := ker fit, = ker H^t + {F\t). 

By Theorem EH for t > t 2 we have (K l ) = I m in{f) and A*(/) = /*. As I min (f) is zero- 
dimensional, for t high enough, a border basis F' of K l in degree < 2t is a border basis of 
(AT*) = I m i n {f). Let B' be the corresponding monomial basis. Then r := \B'\ is the number 
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of minimizers (ie. the points in V (I m i n (f))) . By Lemma 15.11 with E = (B' + ) and Theorem 
14, 5[ any linear form A' G Cf' B)? generic for / is the restriction of a positive linear form 

A' = £ A, 1 

i=l 

with {Ci, ...,Cr} = V"(J m i n (/)) and Aj > and X^ =1 Aj = 1. In this case, A'(/) = 
Yji=i Ai /(Ci) = /* anc I keriTy = {0}. We arrive at a contradiction, which shows that 
the algorithm should stops in step (4), for some degree t. 

By Lemma [5.2[ F' is a border basis of I m in{f) with basis B' and A'(/) = /*. □ 

6. Examples 

This section contains examples which illustrate the behavior of the algorithm. In the first 
example of Motzkin polynomial, the gradient ideal is not zero-dimensional whereas that in 
the second example of Robinson polynomial, the gradient ideal is zero-dimensional. In all 
the examples the minimizer ideal is zero-dimensional hence when we apply our algorithm we 
obtain the good result in a finite number of steps. 

The implementation of the previous algorithm has been performed using the BORDERBASI>H 
package of the MATHEMAGDO software. BORDERBASIX is a C++ implementation of the border 
basis algorithm of |32j . 

Semidefinite positive Hankel operators are computed using the semi definite programming 
routine of SDPa|j software. For the link with SDPA we use a file interface since SDPA is not 
distributed as a library. 

For the computation of border basis, we use as a choice function that is tolerant to nu- 
merical instability i.e. a choice function that chooses as leading monomial a monomial whose 
coefficient is maximal among the choosable monomials. This, according to |22| . makes the 
border basis computation stable with respect to numerical perturbations. This property is 
fundamental as we use results from SDP solvers to get new equations. And as these equations 
are computed numerically, the computation is subject to numerical errors. 

Once the border basis of the minimizer is computed, the roots are obtained using the 
numerical routines described in [7]. 

Experiments are made on an Intel Corei7 2.30GHz with 8Gb of RAM. 

In the following examples, we use the natation H A = H?* 

Example 6.1. We consider the Motzkin polynomial 

f(x, y) = 1 + x 4 y 2 + x 2 y 4 — 3x 2 y 2 

which is non negative on M? but not a sum of squares in M.[x,y]. We compute its gradient 
ideal, I gra d{f) = (— 6xy 2 + 2x?/ 4 -(-4x 3 y 2 , — Qyx 2 + 2yx 4 + 4y 3 x 2 ) which is not zero-dimensional. 

• In the first iteration the degree is 3, the size of the Hankel matrix H \ is 10, minA(/) = 
—216, there is a duality gap hence we try with degree 4- 

— 4 

• In the second iteration the degree is 4, the size of the Hankel matrix H ^ is 15, min 
A(/) = 0, there is no duality gap. The minimum differs from the previous minimum, 
so a new iteration is needed. 

• In the third iteration the degree is 5, the size of the Hankel matrix H ^ is 19 and 
minA(/) = 0. The minimums are equal hence we compute the kernel of H A , which 

^www-sop. inria.fr/ galaad /mmx/borderbasix 

4 www.mathemagix.org 

^sdpa.sourceforge.net 
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is generated by 5 polynomials. We compute the border basis and obtain the basis 
B = {1, x, y, xy}. All the elements of B have degree < 4. 

• We compute a generic form for f with this border basis, the size of the Hankel matrix 
H A is 4, minA(/) =0 and ker H ^ = {0}. 

After the fourth iteration the algorithm stops and we obtain 

(1) I min = (x 2 -l,y 2 -l). 

(2) The basis B = {1, x, y, xy}. 

(3) The points which minimize f {(x = l,y = 1), (x = l,y = — 1), (x = —l,y = l),(x = 

-l,y = -l)}. 

The complete process of resolution took 0.286s on which 0.154s is spent computing SDP. 
Example 6.2. We consider the Robinson polynomial, 

tl \ 1,6 4 2,6 4 2 42 2 4 , n 2 2 

/ {x, y) = 1 + x — x — x + y — y — y — x y — x y + Sx y 

which is non negative on IR 2 but not a sum of squares in R[x, y]. We compute its gradient 
ideal I gra d{f) = (6x 5 - 4x 3 - 2x - 4x 3 y 2 - 2xy 4 + 6xy 2 , 6y 5 - Ay 3 -2y- 4y 3 x 2 - 2yx 4 + Qyx 2 ), 
which is not zero- dimensional. 

3 

• In the first iteration the degree is 3, the size of the Hankel matrix H ^ is 10, min 
A(/) = —0.93, there is no duality gap hence we compare the minimum with degree 4- 

4 

• In the second iteration the degree is 4, the size of the Hankel matrix H A is 15, 
minA(/) = 0. As the minimums are different, another iteration is needed. 

• In the third iteration the degree is 5, the size of the Hankel matrix H ^ is 19 and 

minA(/) =0. The minimums are equal, hence we compute the kernel of H^, which is 
generated by 6 polynomials. We compute the border basis and we obtain 
B = {l,x,y,x 2 ,xy,y 2 ,x 2 y,xy 2 ,x 2 y 2 }. There exists an element of B with degree > A, 
so we go to the next degree. 

• We compute a generic form A for f in degree 6, the size of the Hankel matrix H \ is 

5 

22 and minA(/) =0. The minimums are equal, hence we compute the kernel of H^, 
which is generated by 11 polynomials. We compute the border basis and obtain the 
basis B = {l,x,y,x 2 ,xy,y 2 ,x 2 y,xy 2 }. All the elements of B have degree < 5. 

• We compute a generic form for f with this border basis, the size of the Hankel matrix 
~h\ is 8, minA(/) =0 and kerlT^ = {0}. 

After fourth iteration the algorithm stops and we obtain 

(1) I m in = {x 3 - x,y 3 - y,x 2 y 2 - x 2 - y 2 + 1). 

(2) The basis B = {l 1 x,y,x 2 ,xy,y 2 ,x 2 y 1 xy 2 }. 

(3) The points which minimize f {(x = l,y = 1), (x = l,y = — l),(x = —l,y = 1), (x = 
-l,y = -1), (x = l,y = 0), (x = -l,y = 0), (x = 0,y = 1), (x = 0,y = -1)}. 

The complete process of resolution took 0.315s on which 0.134s is spent computing SDP. 

Example 6.3. We consider the polynomial, 

f(x, y) = -12x 3 + 3xy 2 + Ay 3 - 16x 2 y + 48x 2 - I2y 2 
We compute its gradient ideal, I gra d(f) = (— 36x 2 +3y 2 — 32xy+96x,6xy+12y 2 — 16x 2 — 24y) 

2 

• In the first iteration the degree is 2, the size of the Hankel matrix H ^ is 4, m i- n 
A(/) = —18.6. The minimum differs from the previous minimum -co. Hence, a new 
iteration is needed. 

3 

• In the second iteration the degree is 3, the size of the Hankel matrix is 4 and 
minA(/) = —18.6. The minimums are equal hence we compute the kernel of H A , 
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which is generated by 3 polynomials. We compute the border basis and we obtain 
B = {1}. All the elements of B have degree < 2. 

• We compute a generic form for f with this border basis, the size of the Hankel matrix 
H\ is 1, min A(/) = -18.6 and keiTT^ = {0}. 

After second iteration the algorithm stops and we obtain 

(1) Imin = {x + 0.43636, y - 2.32727). 

(2) The basis B = {1}. 

(3) The points which minimize f, {(x = — 0.43636, y = 2.32727)}. 

The complete process of resolution took 0.427s on which 0.130s were spent computing SDP. 

Example 6.4. We consider the Leep and Starr polynomial, 

f(x, y) = 16 + x 2 y A + 2x 2 y 3 - 4x 3 y 3 + 4xy 2 + 20x 2 y 2 + 8x 3 y 2 + 6x 4 y 2 + 8xy - Wx 2 y 

that is positive on R 2 but cannot be written as sum of squares in y\. We compute its 
gradient ideal, 

Igrad( f) = (2zy 4 + 4xy 3 - 12x 2 y 3 + 4y 2 + 40xy 2 + 24x V + 24x 3 y 2 + 8y- 32xy, ix 2 y 3 + 6x 2 y 2 - 
12x 3 y 2 + 8xy + 40x 2 y + 16x 3 y + 12x 4 y + 8x - 16x 2 ) 

3 

• In the first iteration the degree is 3, the size of the Hankel matrix H A is 10, min 
A(/) = —5.4, there is duality gap we try with degree 4 but we change the basis we try 
again with degree 3. 

3 

• In the second iteration the degree is 3, the size of the Hankel matrix H A is 10, min 
A(/) = 0.6, there is no duality gap. As the minimum differs from the minimum in 
degree 2, a new iteration step is performed. 

• In the third iteration the degree is 4, the size of the Hankel matrix H A is 11 (with 

the reduction) and min A(f) = 0.6. The minimums are equal, hence we compute the 
3 

kernel of H A , which is generated by 9 polynomials. We compute the border basis and 
obtain the basis B = {1}. All the elements of B have degree < 3. 

• We compute a generic form for f with this border basis, the size of the Hankel matrix 
H\ is 1, min A(/) = 0.6 and keri?^ = {0}. 

After the third iteration the algorithm stops and we obtain 

(1) I min = (x + 3.3884, y - 0.14347). 

(2) The basis B = {1}. 

(3) The points which minimize f, {(x = —3.3884, y = 0.14347)}. 

The complete process of resolution took 0.417s on which 0.130s were spent computing SDP. 

The experiments suggest that due to the small size of the matrices H\ most of the resolution 
time is spent during a classical border basis computation, we all the more emphasize this, as 
we used a file interface for communicating with sdpa which is rather slow. 
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